Equivalence , Containment and Rewriting of Aggregate Queries

نویسنده

  • Sara Cohen
چکیده

The primary goal of this thesis is to lay the theoretical foundations for a formal study of aggregate query optimization. This requires gaining a coherent understanding of equivalences and containments between aggregate queries of varied forms. A secondary goal of this thesis is to solve the view usability problem for varied types of aggregate queries. The view usability problem is that of determining whether it is possible to find a reformulation of a query that uses other queries, instead of base relations. This is of importance in query optimization, since it enables reuse of previously computed results. It is also a necessity when integrating information from heterogeneous sources, when each source is described as a view over a global schema. In order to achieve the primary goal, the general problem of determining equivalences among aggregate queries is considered. Our aggregate queries may contain disjunctions, comparisons and negated subgoals. First, the bounded equivalence problem, i.e., the problem of determining equivalence of queries with respect to databases whose size is bound by a constant, is studied. A complete characterization of when bounded equivalence is decidable is presented. It is shown how to determine equivalence by a reduction to local equivalence (a special case of bounded equivalence). Characterizations of equivalence for the special case of quasilinear queries are presented. Equivalence of quasilinear queries can be determined in polynomial time. Containment of aggregate queries is reduced to equivalence of aggregate queries. All the characterizations are couched in terms of abstract characterizations of aggregation functions. Thus, the results presented are easily extendible to additional aggregation functions. These results also solve additional open problems, including

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Algorithms for Rewriting Aggregate Queries Using Views Algorithms for Rewriting Aggregate Queries Using Views Algorithms for Rewriting Aggregate Queries Using Views Algorithms for Rewriting Aggregate Queries Using Views

Queries involving aggregation are typical in database applications. One of the main ideas to optimize the execution of an aggregate query is to reuse results of previously answered queries. This leads to the problem of rewriting aggregate queries using views. Due to a lack of theory, algorithms for this problem were rather ad-hoc. They were sound, but were not proven to be complete. Recently we...

متن کامل

Algorithms for Rewriting Aggregate Queries Using Views

Queries involving aggregation are typical in database applications. One of the main ideas to optimize the execution of an aggregate query is to reuse results of previously answered queries. This leads to the problem of rewriting aggregate queries using views. Due to a lack of theory, algorithms for this problem were rather ad-hoc. They were sound, but were not proven to be complete. Recently we...

متن کامل

Algorithm for Term Linearizations of Aggregate Queries with Comparisons

We consider the problem of rewriting queries based exclusively on views. Both queries and views can contain aggregate functions and include arithmetic comparisons. To study the equivalence of a query with its rewriting query, the so called ”linearizations of a query” need to be computed. To find the linearizations of a query, the linearizations of terms from the query need to be generated. We p...

متن کامل

Equivalence of Queries with Nested Aggregation

Query equivalence is a fundamental problem within database theory. The correctness of all forms of logical query rewriting—join minimization, view flattening, rewriting over materialized views, various semantic optimizations that exploit schema dependencies, federated query processing and other forms of data integration—requires proving that the final executed query is equivalent to the origina...

متن کامل

The complexity of higher-order queries

Higher-order transformations are ubiquitous within data management. In relational databases, higher-order queries appear in numerous aspects including query rewriting and query specification. This work investigates languages that combine higher-order transformations with ordinary relational database query languages. We study the two most basic computational problems associated with these query ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004